Picture for Junqi Zhao

Junqi Zhao

On the Difficulty of Learning a Meta-network for Training Data Selection

Add code
May 30, 2026
Viaarxiv icon

Are LLMs Smarter Than Chimpanzees? An Evaluation on Perspective Taking and Knowledge State Estimation

Add code
Jan 18, 2026
Viaarxiv icon

Region-Specific Audio Tagging for Spatial Sound

Add code
Sep 11, 2025
Viaarxiv icon

AudioTurbo: Fast Text-to-Audio Generation with Rectified Diffusion

Add code
May 28, 2025
Figure 1 for AudioTurbo: Fast Text-to-Audio Generation with Rectified Diffusion
Figure 2 for AudioTurbo: Fast Text-to-Audio Generation with Rectified Diffusion
Figure 3 for AudioTurbo: Fast Text-to-Audio Generation with Rectified Diffusion
Figure 4 for AudioTurbo: Fast Text-to-Audio Generation with Rectified Diffusion
Viaarxiv icon

BUZZ: Beehive-structured Sparse KV Cache with Segmented Heavy Hitters for Efficient LLM Inference

Add code
Oct 30, 2024
Figure 1 for BUZZ: Beehive-structured Sparse KV Cache with Segmented Heavy Hitters for Efficient LLM Inference
Figure 2 for BUZZ: Beehive-structured Sparse KV Cache with Segmented Heavy Hitters for Efficient LLM Inference
Figure 3 for BUZZ: Beehive-structured Sparse KV Cache with Segmented Heavy Hitters for Efficient LLM Inference
Figure 4 for BUZZ: Beehive-structured Sparse KV Cache with Segmented Heavy Hitters for Efficient LLM Inference
Viaarxiv icon

Universal Sound Separation with Self-Supervised Audio Masked Autoencoder

Add code
Jul 16, 2024
Viaarxiv icon

What Are We Measuring When We Evaluate Large Vision-Language Models? An Analysis of Latent Factors and Biases

Add code
Apr 03, 2024
Viaarxiv icon

InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning

Add code
May 11, 2023
Figure 1 for InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning
Figure 2 for InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning
Figure 3 for InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning
Figure 4 for InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning
Viaarxiv icon

Applying Incremental Deep Neural Networks-based Posture Recognition Model for Injury Risk Assessment in Construction

Add code
Aug 04, 2020
Figure 1 for Applying Incremental Deep Neural Networks-based Posture Recognition Model for Injury Risk Assessment in Construction
Figure 2 for Applying Incremental Deep Neural Networks-based Posture Recognition Model for Injury Risk Assessment in Construction
Figure 3 for Applying Incremental Deep Neural Networks-based Posture Recognition Model for Injury Risk Assessment in Construction
Figure 4 for Applying Incremental Deep Neural Networks-based Posture Recognition Model for Injury Risk Assessment in Construction
Viaarxiv icon